Provision of resource allocation information

ABSTRACT

A system is provided for disseminating resource allocation information from system resources to state-information observers comprising resource users and typically also at least one system resource manager. Each resource maintains state information about its identity and its allocation to one or more resource users. Each resource provides this information to a state-dissemination arrangement which disseminates it to each state-information observer. Each resource user uses the state information it receives from the state-dissemination arrangement to ascertain the resources allocated to it. Similarly, a system resource manager, when present, uses the state information it receives from the state-dissemination arrangement to ascertain the allocation of those resources that are of interest to the manager. A resource, resource user and resource manager for use in such a system are also provided.

FIELD OF THE INVENTION

The present invention relates to the provision of resource allocation information to entities of a processing system, and to a resource entity, resource user entity and resource manager entity for use in such a system.

BACKGROUND OF THE INVENTION

Computer systems can be viewed as containing resource entities of various types that are used by resource user entities to provide a particular service. Typical resource entities (or simply ‘resources’) include entities for running programs, storing data, providing communication, or performing some other function, such as encryption/decryption. A resource user entity (or more simply ‘resource user’) can, for example, be constituted by an application program providing a particular service.

Typically, multiple resource users will exist concurrently in a computer system with the current population of resource users changing over time according to the needs of human end users; this is particularly the case if the computer system is a very large facility such as a data center. Also, from time to time a resource will fail to operate correctly and need to be replaced; conversely, a resource user may fail, effectively freeing up the resources it was using. For the foregoing reasons, the allocation of resources to resource users needs to change over time and this must be managed appropriately. In particular, resources must be allocated in way which ensures that each resource user is aware of the resources that have been allocated to it and can use them, and that the system does not lose track when failures occur. The role of managing resource allocation is carried by one or more resource managers; of course, where multiple resource managers are used, the problem of coordinating resource allocation becomes even harder.

Past attempts to solve this problem have typically relied on an inventory or a similar representation to maintain information about the resources and their allocation.

It is an object of the present invention to provide a way of managing resource allocation that facilitates the provision of allocation information to entities that require to know such information.

SUMMARY OF THE INVENTION

According to a first aspect of the present invention, there is provided a system comprising:

-   -   a plurality of resources each arranged to maintain and provide         state information about its allocation to one or more resource         users and its identity;     -   a state-dissemination arrangement for disseminating the state         information provided by the resources; and     -   at least one state-information observer arranged to receive         state information from the state-dissemination arrangement, said         at least one state-information observer comprising at least one         resource user arranged to use the state information it receives         to ascertain which of the resources, if any, have been allocated         to it

Typically, the said at least one state-information observer further comprises at least one resource manager arranged to receive state information from the state-dissemination arrangement whereby to ascertain the allocation of the resources of interest to the manager.

The state-dissemination arrangement can be arranged to deliver the state information provided by all the resources to every state-information observer. Preferably, however, the or each state-information observer is arranged to register with the state-dissemination arrangement to indicate its interest in particular state information, and the state-dissemination arrangement is arranged to use these registered interests to manage the dissemination of state information.

In one preferred embodiment, the state-dissemination arrangement includes communication timing means for monitoring the communication time taken to disseminate information from a resource to the or each state-information observer that wishes to receive state information from it, the communication timing means being arranged to cause the or each such state-information observer to be informed, upon the monitored communication time for disseminating information to it from the resource concerned exceeding a predetermined time value, that state information for the resource is no longer available. In this case, each state-information observer can assume that any resource and resource allocation it observes is correct to within the aforesaid predetermined time limit. A resource manager can assume that any resource allocation it observes is either observed, or its absence is observed, by all interested resource users and other resource managers, if any, within the predetermined time limit. This level of consistency allows a resource manager to know allocations do not conflict.

Advantageously, the state-dissemination arrangement further includes partition means for identifying non-overlapping collections where each collection comprises at least one resource and at least one state-information observer between all of which state information can be disseminated within said predetermined time limit as monitored by the communication timing means; the at least one state-information observer of a collection being arranged to take account of state information only from resources within the same collection; and the state-dissemination arrangement being further arranged to inform the state-information observers of a collection of any disruption to collection membership whereby each such state-information observer knows that it cannot rely upon the receipt, by interested state-information observers of the collection, of any item of state information which the state-information observer itself has received within an immediately preceding time period of duration corresponding to twice said predetermined time limit. In this case, each state-information observer in a collection can assume that any resource and resource allocation it observes is also observed by all other interested state-information observers in the same collection within the aforesaid predetermined time limit and is not observed by any state-information observer outside its collection. This level of consistency allows multiple resource managers in a collection to take coordinated actions without requiring additional direct communication. In addition resource managers that are partitioned from each other can coordinate with each other in respect of certain actions involved in a partition change.

In terms of its constituent entities, a preferred embodiment of a system according to the present invention comprises:

-   -   a resource entity arranged to maintain state information about         its allocation to one or more resource users and its identity,         and to provide this information, at least upon a change of         allocation of the resource entity, to the state dissemination         arrangement whereby to enable resource user entities to         ascertain whether they have been allocated the resource entity;     -   a resource user entity arranged to receive from the state         dissemination arrangement state information that has been         provided by at least one resource entity and comprises         information about the allocation of the resource entity to one         or more resource users and the identity of the resource entity,         the resource user entity being arranged to use the received         state information to ascertain which resources have been         allocated to it; and     -   a resource manager entity arranged to receive from the state         dissemination arrangement state information that has been         provided by at least one resource entity and comprises         information about the allocation of the resource entity to one         or more resource users and the identity of the resource entity,         the resource manager entity being arranged to use the received         state information to ascertain the allocation of resources of         interest to it; and the resource manager being further arranged         to output allocation messages to set the allocation of the or         each resource entity of interest to it.

Each of these entities individually embodies aspects of the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention will now be described, by way of non-limiting example, with reference to the accompanying diagrammatic drawings, in which:

FIG. 1 is a diagram illustrating the general operation of a state-dissemination service employed in embodiments of the invention;

FIG. 2 is a diagram of a distributed system with multiple processing nodes each including a state-dissemination server;

FIG. 3 is a diagram of a first form of state-dissemination server usable in the FIG. 2 system;

FIG. 4 is a diagram illustrating local register tables maintained by a state manager of the FIG. 3 state-dissemination server;

FIG. 5 is a diagram illustrating global register tables maintained by a state manager of one of the state-dissemination servers of the FIG. 2 system;

FIG. 6 is a diagram illustrating enhancements to the form of state-dissemination server shown in FIG. 3; and

FIG. 7 is a diagram illustrating the use of the FIG. 2 system in disseminating resource allocation information.

BEST MODE OF CARRYING OUT THE INVENTION

The embodiments of the invention to be described hereinafter are based on the dissemination of state information about an entity of a system from that entity to other entities of the system. FIG. 1 depicts the general operation of such a state-dissemination service. More particularly, FIG. 1 shows three entities 10, 11, and 12 each of which has access to a state-dissemination service 15. The entity 11 has state information that it is willing to share with other entities 10, 12; accordingly, the entity 11 provides its state information to the state-dissemination service 15, this typically being done each time the information changes in any way. The state-dissemination service 15 is then responsible for providing the state information concerning entity 11 to the entities 10 and 12.

The state-dissemination service 15 can be arranged simply to supply the state information it receives from any entity to every other entity; however, preferably, each entity that wishes to receive state information registers a state-information indicator with the state-dissemination service 15 to indicate the particular state information in which it is interested in receiving. This indicator could, for example, simply indicate that the registering entity wants to receive all state information provided by one or more specified other entities; alternatively, the indicator could indicate the identity of the particular state information that the registering entity wants to receive regardless of the entity providing it. In this latter case, when state information is provided by an entity to the state-dissemination service 15, the providing entity supplies a state-information identifier which the service 15 seeks to match with the indicators previously registered with it; the provided state information is then passed by the state-dissemination service to the entities which have registered indicators that match the identifier of the provided state information.

Rather than this matching being effected by the state-dissemination service 15 at the time the state information is provided to it, entities that intend to provide state information to the service 15 are preferably arranged to register in advance with the service to specify state-information identifier(s) for the state information the registering entity intends to provide; the state-dissemination service 15 then seeks to match the registered identifiers with the registered indicators and stores association data that reflects any matches found. The association data can directly indicate, for each registered identifier, the entities (if any) that have registered to receive that information; alternatively, the association data can be less specific and simply indicate a more general pattern of dissemination required for the state information concerned (for example, where the entities are distributed between processing nodes, the association data can simply indicate the nodes to which the state information should be passed, it then being up to each node to internally distribute the information to the entities wishing to receive it). The association data is updated both when a new identifier is registered and when a new indicator is registered (in this latter case, a match is sought between the new indicator and the registered identifiers).

When an entity subsequently provides state information identified by a state-information identifier to the state-dissemination service, the latter uses the association data to facilitate the dissemination of the state information to the entities that have previously requested it by registering corresponding state-information indicators.

As will be more fully described below, where the entities are distributed between processing nodes, the state-dissemination service is preferably provided by an arrangement comprising a respective state-dissemination server entity at each node. In addition, where the state-dissemination service operates by generating association data from supplied state-information identifiers and indicators, preferably not only are the state-information identifiers and indicators associated with the entities at each node recorded in registration data held by that node, but the association data concerning the state-information identifiers registered by the node entities of that node is also stored at the node. Furthermore, each node preferably stores source data indicating, for each state-information indicator registered by the entities of that node, the origin of the corresponding state information. As will be explained hereinafter, by arranging for this local storage of registration data, association data and source data, a relatively robust and scalable state-dissemination service can be provided.

FIG. 2 shows an example distributed system with multiple processing nodes 20, 21 and 22 arranged to intercommunicate via any suitable communication arrangement here shown as a network 23. Node 20 includes entities 24, 25 and 26, whilst node 21 includes entity 27 and node 22 includes entities 28 and 29.

The FIG. 2 system operates a state-dissemination service provided by a state-dissemination arrangement comprising a respective state-dissemination (SD) server 50A, 50B and 50C at each node 20, 21 and 22; the SD servers are arranged to communicate with each other via the network 23.

Each one of the entities 24 to 29 that intends to provide state information to the state-dissemination service is arranged to register a corresponding state-information identifier with the local SD server 50 (that is, with the SD server at the same node). To this end, each such entity instantiates a software “state provider” object P (generically referenced 40) and passes it the identifier of the state information to be provided to the state-dissemination service. The state provider object 40 is operative to the register itself and the state-information identifier with the local SD server 50 and the latter stores this registration data in a local register 61; the state provider object 40 is also operative to subsequently provide instances of the identified state information to the SD server.

Similarly, each one of the entities 24 to 29 that wishes to receive particular state information from the state-dissemination service is arranged to register a corresponding state-information indicator with the local SD server 50 (that is, with the SD server at the same node). To this end, each such entity instantiates a software “state listener” object L (generically referenced 41) and passes it the indicator of the state information to be provided by the state-dissemination service. The state listener object 41 is operative to register itself and the state-information indicator with the local SD server 50 and the latter stores this registration data in the local register 61; the state listener object 41 is also operative to subsequently receive the indicated state information from the SD server.

It will be appreciated that the use of software state provider and listener objects 40 and 41 to interface the entities 24 to 29 with their respective SD servers 50 is simply one possible way of doing this.

In the present example, regarding the provision of state information:

-   -   Entity 24 of node 20 is arranged to provide state information         identified by state-information identifier ‘S1’ to which end the         entity instantiates state provider 40A which registers itself         and the identifier S1 with SD server 50A;     -   Entity 26 of node 20 is arranged to provide state information         identified by state-information identifier ‘S2’ to which end the         entity instantiates state provider 40B which registers itself         and the identifier S2 with SD server 50B; and     -   Entity 29 of node 22 is arranged to provide state information         identified by state-information identifier ‘S3’ to which end the         entity instantiates state provider 40C which registers itself         and the identifier S3 with SD server 50C; Regarding the receipt         of state information:     -   Entity 24 of node 20 is interested in receiving state         information indicated by state-information indicator ‘S3’ to         which end the entity instantiates state listener 41A which         registers itself and the indicator S3 with SD server 50A;     -   Entity 25 of node 20 is interested in receiving state         information indicated by state-information indicator ‘S1’ to         which end the entity instantiates state listener 41B which         registers itself and the indicator S1 with SD server 50A;     -   Entity 27 of node 21 is interested in receiving state         information indicated by either one of state-information         indicators ‘S2’ and S3’, to which end the entity instantiates         corresponding state listeners 41C and D each of which registers         itself and the indicator S2 and S3 respectively with SD server         50B; and     -   Entity 28 of node 22 is interested in receiving state         information indicated by any one of state-information indicators         ‘S1’, ‘S2’ and S3’, to which end the entity instantiates         corresponding state listeners 41E, F, and G each of which         registers itself and the indicator S1, S2 and S3 respectively         with SD server 50C.

The data registered by the or each state provider and/or listener associated with a particular node constitutes registration data and is held by the SD server of that node.

In this example, it can be seen that the same state-information labels S1, S2, and S3 have been used for the state-information identifiers and indicators; in this case, the matching of identifiers and indicators carried out by the state-dissemination service simply involves looking for a full match between an identifier and indicator. However, using exactly the same identifiers and indicators is not essential and matching based on parts only of an identifier and/or indicator is alternatively possible (for example, the state-dissemination service can be arranged to determine that a state-information indicator ‘abcd’ is a match for a state-information identifier ‘abcdef’). Furthermore, although not illustrated in the FIG. 2 example, an entity can be arranged to provide the same state information under several different identifiers; in the present case, this involves instantiating a respective state provider for each identifier. In addition, as well as more than one state listener registering the same state-information indicator as illustrated in FIG. 2, more than one state provider can register the same state-information identifier.

The state-dissemination service provided by the SD servers 50A-C is arranged to derive association data and source data from the registered state-information identifiers and indicators. In the present case, the association data is used to indicate, for each state-information identifier, the SD server(s) where corresponding indicators have been registered; the source data is used to indicate, for each state-information indicator, the SD server(s) where corresponding identifiers have been registered (of course, the source data can also be considered to be a form of association data, however, the term ‘source data’ is used herein to distinguish this data from the above-mentioned data already labelled with the term ‘association data’). For each identifier, the corresponding association data is held by the SD server where the identifier is registered; similarly, for each indicator, the corresponding source data is held by the SD server where the indicator is registered. As will be more fully explained below with reference to FIGS. 3 to 5, the association data and source data are determined in the present example by making use of a global register 91, maintained by one of the SD servers, that records the SD server(s) where each identifier and indicator has been registered. The global register 91 is only used for compiling the association data and source data and its loss is not critical to the dissemination of state information in respect of previously registered state-information identifiers and indicators already taken account of in the association data held by operative SD servers; furthermore, the contents of the global register can be reconstituted from the registration data held by the operative SD servers.

FIG. 3 shows in more detail one implementation of the SD servers 50 of the FIG. 2 system. The SD server 50 shown in FIG. 3 comprises a state manager functional block 51 and a communications services functional block 53, the latter providing communication services (such as UDP and TCP) to the former to enable the state manager 51 to communicate with peer state managers of other SD servers. The state manager 51 comprises a local registry 60, an outbound channel for receiving state information from a local state provider 40 and passing this information on to other SD servers 50 as required, and an inbound channel 80 for distributing state information received from other SD servers 50 to interested local listeners 41. The state manager of one of the SD servers also includes a global registry; all SD servers have the capability of instantiating the global register and the servers agree amongst themselves by any appropriate mechanism which server is to provide the global registry. The registry is not shown in the state manager 51 of FIG. 3 but is separately illustrated in FIG. 5

The local registry 60 comprises the local register 61 for holding the registration data concerning the local entities as represented by the local providers 40 and listeners 41, the association data for the state-information identifiers registered by the local providers 40, and source data for the state-information indicators registered by the local listeners 41. As depicted in FIG. 4, the local register 61 is actually organised as two tables, namely a local provider table 95 and a local listener table 66.

In the local provider table 65, for each identifier registered by a local provider 40, there is both a list of the or each local provider registering that identifier, and a list of every SD server, if any, where a matching state-information indicator has been registered. Table 65 thus holds the registration data for the local providers 40 and their associated identifiers, along with the association data concerning those identifiers.

In the local listener table 66, for each indicator registered by a local listener 41, there is both a list of the or each local listener registering that indicator, and a list of every SD server, if any, where a matching state-information identifier has been registered. Table 66 thus holds the registration data for the local listeners 41 and their associated indicators, along with the source data concerning those indicators.

With respect to the global registry 90 (FIG. 5), this comprises a global register 91 holding both a provider table 95 and a listener table 96. The provider table 95 lists the state-information identifiers that have been notified to it and, for each identifier, the or each SD server where the identifier is registered. The listener table 96 lists state-information indicators that been have notified to it and, for each indicator, the or each SD server where the indicator is registered.

When a local provider 40 is first instantiated, a registration/deregistration functional element 42 of the provider 40 notifies the local registry 60 and the registration process proceeds as follows:

-   (a) A functional element 62 of the registry 60 checks if the     state-information identifier associated with the new provider is     present in provider table 65 if not, a new entry is added. The     functional element 62 then adds the identity of the new provider to     the entry for the associated identifier in the provider table 65. -   (b) If a new entry had to be created in table 65 for the identifier     associated with the new provider, then the following operations are     effected:     -   (i) The functional element 62 sends an identifier registration         message including the registration details to the global         registry 90 by using the communication services provided by         block 53.     -   (ii) A functional element 92 of the global registry 90 effects         the following operations upon receipt of the identifier         registration message at the global registry:         -   A check is first made as to whether the identifier concerned             is already present in the provider table 95 and, if so, the             identity of the SD server from which the identifier             registration message was sent is added to the list of             servers associated with the existing entry for the             identifier; if there is no existing entry for the identifier             in table 95, a new entry is created and the identity of the             SD server from which the just-received message was sent is             made the first entry in the list of servers associated with             the new entry.         -   Matches are sought between the identifier in the identifier             registration message and the state-information indicators in             the listener table 96. A list of the SD servers associated             with any matches found (the ‘listener SD servers’) is then             returned in an association-data update message to the local             registry 60 which sent the identifier registration message.     -   (iii) The SD-server list returned in the association-data update         message to the local registry 60 of the SD server that         originated the identifier registration message, is received by a         functional element 64 which then updates the association data         held in the local provider table 65 of register 61 in respect of         the identifier concerned, by adding the listener SD servers in         the association-data update message to the list of listener SD         servers for that identifier.

In a similar manner, when a local listener 41 is first instantiated, a registration/deregistration functional element 43 of the listener 41 notifies the local registry 60 and the registration process proceeds as follows:

-   (a) A functional element 63 of the registry 60 checks if the     state-information indicator associated with the new listener is     present in listener table 66 if not, a new entry is added. The     functional element 63 then adds the identity of the new listener to     the entry for the associated indicator in the listener table 66. -   (b) If a new entry had to be created in table 65 for the identifier     associated with the new provider, then the following operations are     effected:     -   (i) The functional element 63 sends an indicator registration         message including the registration details to the global         registry 90 by using the communication services provided by         block 53.     -   (ii) A functional element 93 of the global registry effects the         following operations upon receipt of the identifier registration         message at the global registry:         -   A check is first made as to whether the indicator concerned             is already present in the listener table 96 and, if so, the             identity of the SD server from which the indicator             registration message was sent is added to the list of             servers associated with the existing entry for the             indicator; if there is no existing entry for the indicator             in table 96, a new entry is created and the identity of the             SD server from which the just-received message was sent is             made the first entry in the list of servers associated with             the new entry.         -   Matches are sought between the indicator in the indicator             registration message and the state-information identifiers             in the provider table 95. Each of the SD servers associated             with any matches found (the ‘provider SD servers’) is then             sent an association-data update message including the             identity of the SD server that originated the registration             message and the relevant identifier(s) found to match the             newly registered indicator.     -   (iii) At each SD server that receives an association-data update         message, the functional element 64 updates the association-data         held in the local provider table 65 of register 61 by adding the         SD server included in the association-data update message to the         list of listener SD servers for the or each identifier         referenced in the message.

With regard to the updating of the source data held in the local listener table 66 of each SD server 66 in response to the registration of a new provider 40 or listener 41, this is effected by the inbound channel 80 of each SD server when it receives state information in respect of an identifier that the registry 60 finds is a match for one or more state-information indicators in the table 66 (the handling of newly-received state information by the state manager 60 is described more fully below)

Rather than a newly registered listener having to wait for a change in state information for which it has registered before receiving that state information, provision can be made for providers of this information to send the current version of the state information of interest to the listener concerned (either by a dedicated exchange of messages or by the provider(s) being triggered to re-send their information via the state-dissemination arrangement).

The deregistration of a provider 40 or listener 41 is effectively the reverse of registration and involves the same functional elements as for registration. The main difference to note is that an identifier/indicator deregistration message is only sent from the local registry 60 to the global registry 90 if a state-information identifier or indicator is removed from the local provider table 65 or local listener table 66 (which is done when there ceases to be any associated provider or listener respectively).

In normal operation, upon an entity detecting a change in state information for which it has a provider 40 registered with its local register 60, a functional element 44 of the provider notifies the outbound channel 70 of the local register that there is new state information in respect of the state-information identifier concerned. A functional element 72 of the outbound channel 70 then looks up in the local provider table 65 of the register 60, the association data for the identifier in order to ascertain the SD servers to which the new state information needs to be sent; the new state information is then distributed, together with its identifier, to these servers by functional element 74. This distribution will typically involve use of the communication services provided by block 53; however, where a local listener 41 (that is, one at the same node) has registered to receive the state information, then the functional element 74 simply passes it to the inbound channel 80 of the same server (see arrow 77 in FIG. 3).

When an SD server 50 receives new state information, identified by a state-information identifier, from another SD server, it passes the information to the inbound channel 80 of the state manager 51. Upon new state information being received at the inbound channel 80 (whether from another SD server or from the local outbound channel), a functional element 82 of the inbound channel uses the identifier associated with the new state information to look up in the local listeners table 66 the listeners that have registered state-information indicators that match the identifier. The functional element 82 also checks that the SD server that sent the state information is in the list of provider SD servers for each matched indicator, if this is not the case, the list is updated (thereby updating the source data for the indicator concerned). A functional element 84 of the inbound channel is then used to distribute the received state information to the matched listeners 41 where it is received by respective functional elements 45 of the listeners.

As so far described, the state-dissemination arrangement of the FIG. 2 system provides a basic state-dissemination service (in fact, for this basic service, the source data and the functional elements that handle and use it are not required). This basic state-dissemination service only permits certain limited assumptions to be made by entities using the service; thus, an entity that has registered to receive particular state information can only assume that any version of this information that it observes has existed at some stage, but cannot assume that other entities registered to receive the information have also observed the same information.

As will be described below with reference to FIG. 6, the basic state-dissemination arrangement is preferably enhanced to provide better consistency properties for the state information it disseminates. More particularly, two enhanced forms of state-dissemination arrangement are described:

-   -   in the first enhanced form (herein referred to as the “TSD”         arrangement) connection-timing functionality 56 is added to the         communications services functional block 53 of each SD server 50         to provide the overall arrangement with the properties of a         fail-aware timed asynchronous system, and     -   in the second enhanced form (herein referred to as the “TPSD”         arrangement) in addition to the connection-timing functionality,         a partition manager 52 is inserted between the state manager 51         and the communications services block 53 of each SD server to         divide the state-dissemination arrangement into partitions. A         partition is a collection of entities in a system that can all         pass state information to one another within a given time limit.         If two entities cannot pass state information between one         another within the time limit they cannot be in the same         partition. All entities exist in exactly one partition.

It may be noted that, for present purposes, any internal time delays in a node in passing state information received by an SD server to a listener or in notifying it that the information is no longer available, can be discounted. The communication timings between SD servers are therefore taken as being representative of the communication timings between entities (more specifically, between providers and matched listeners).

Considering first the TSD arrangement, the connection-timing functionality 56 added to the communications services block 53 comprises a respective timed-connection functional element 57 for checking the timing of communication between every other SD server and the subject SD server. This check involves checking that communication is possible between every other SD server and the subject server within a predetermined time value (for example, 3 seconds). T this end, every SD server is provided with a heartbeat message function 58 which broadcasts periodic messages, identifying the originating SD server, to every other server; this broadcast is, for example effected using the UDP service provided by the block 53. When an SD server receives such a heartbeat messages it passes it to the timed-connection functional element 57 associated with the server that originated the heartbeat message. This functional element 57 thereupon resets a timer that was timing out a period equal to the aforesaid predetermined time interval. Provided this timer is reset before time out, the connection with the corresponding server is considered to be timely. The interval between heartbeat messages is such that several such messages should he received by an associated timed-connection functional element 57 over a period equal to the predetermined time value so that it is possible for a heartbeat message to be missed without the corresponding timer timing out.

In the event that the timer of a timed-connection functional element 57 times out, the state manager 51 of the same SD server is notified that timely communication with the server associated with that functional element 57 has been lost. The state manager 51 then uses the source data held in the local register 61 to determine which of the local listeners 41 were registered to receive state information from the SD server with which timely communication has been lost; these listeners are then informed that state information is no longer available from this server.

The heartbeat messages broadcast by a SD server 50 also enables a new SD server to announce itself to the existing SD servers, the connection timing function 56 of each existing SD server being arranged to listen out for broadcast heartbeat messages from new SD servers and to instantiate a new timed-connection functional element 57 for each such server detected.

It will be appreciated that the above described way of checking communication timing is simply one example of how to carry out this task and many other ways are possible, for example, by the use of round trip timing or by time-stamping one-way messages using synchronized clocks at all SD servers.

The operational messages passed between the SD services (such as those used to distribute state information) are, in the present example, sent on a point to point basis using the TCP service provided by block 53. These messages are preferably also used for checking communication timing, temporarily substituting for the heartbeat messages.

The enhanced state-dissemination service provided by the TSD arrangement ensures that listeners only receives timely information. Furthermore, a state listener can assume that all other state listeners with an equivalent matching indicator will either see the same state information from a given provider within the aforesaid predetermined time limit or are notified that there is no such state information within the same time limit.

Considering next the TPSD arrangement, the partition manager 52 that is interposed between the communication services block 53 and the state manager 51 in each SD server, implements a partition membership protocol and a leader election protocol. Suitable implements of such protocols will be apparent to person skilled in the art so only a brief description is given here.

The partition manager 52 uses three conceptual views of the SD servers that are participating in the state-dissemination service, each view being determined locally. The first, the connection set, is the set of connections between the subject SD server and other SD servers identified by the communication services block 53. The second view, the connection view 54, is derived directly from the connection set and represents SD servers that are potential members of a partition including the subject SD server. All SD servers in the connection set are admissible to the connection view 54, except those that are untimely or have recently been untimely. All partition managers 52 communicate their connection views 54 to each other whenever these views change, so each SD server has a copy of the connection view derived by every node in its own connection view—the fact that these connections are timely guarantees that the exchanges of connection views are timely.

The collection of connection views 54 known to the partition manager 52, including its own view, are used to derive the partition including the subject SD server. A partition manager 54 is said to be stable when its collection of connection views remain unchanged and they all agree (i.e. they are all the same). When stable, the partition manager 54 sets the partition 55 to be the same as the local connection view. When unstable, the partition manager 54 reduces the partition by selectively evicting SD servers according to the changes. Each partition manager 54 derives its own partition, but the sharing of connection views and the function used to derive the partition provide the following properties:

-   -   1. If a partition manager is stable and its partition is P, then         all partitions derived elsewhere are either subsets of P or do         not intersect P.     -   2. If two partition managers are stable and their partitions are         P and Q, then either P equals Q or P does not intersect Q.     -   3. If a partition manager is continuously stable between times         t−Δ and t and its partition is P, then each node in P is stable         at time t−Δ and has the same partition (here Δ is the aforesaid         predetermined time limit).

The second property is actually derived from the first, if two partitions are subsets of each other then clearly they are the same, and so these two actually represent one property. The second property is stated to emphasise the point that the partition managers either converge on the same partition or distinctly different partitions—they do not overlap. As a result, by the time one partition manager stabilizes, all SD servers that are excluded from its partition know that they are excluded; or rather they derive their own partition that does not intersect it. The third property demonstrates that if the partition remains stable then all SD servers will figure this out.

The leader election protocol operates similarly to the partition protocol. As well as exchanging connection views 54 the partition managers 52 exchange leader candidates. Each manager re-evaluates its choice of leader when connection view changes occur in such a way that they all chose the same leader. Conveniently, the leader SD server provides the global registry 90.

By arranging for each SD server 50 only to send registration messages to the global registry 90 of the same partition 55, the state listeners 41 only see state information from state providers 40 that are in the same partition as them.

The enhanced state-dissemination service provided by the TPSD arrangement enables a state listener to assume that all other state listeners with equivalent matching indicators are either in the same partition and see all the same state information within the given predetermined time limit or they are not in the same partition and do not see any of the same state information within the same time limit.

Listeners are informed by the SD servers when the partition has become unstable. If a provider provides state information s at time t to the TPSD service, then provided the partition remains stable, all interested listeners will receive the information s by time t+Δ. All such listeners can each then know by time t+2Δ that all other interested listeners have received the information s because it will be aware by this time of any disruption of the partition that would have prevented another interested listener from receiving the information by the time t+Δ.

Put another way, whenever an entity is informed by its local SD server that the partition of which it is a member is no longer stable, such an entity knows that it cannot rely upon the receipt by interested entities of the partition, of any item of lifecycle-state information which the entity itself has received within an immediately preceding time period of duration corresponding to 2Δ.

It may be noted that the TPSD service has the effect of partitioning the totality of state information knowledge. When the partitions are stable, two entities either have access to the same knowledge partition or non-overlapping knowledge partitions. So, whatever state information the entities are interested in knowing, even if these are completely different items of state information, will be consistent. Thus, if a first entity knows state information s by time t+Δ, then at time t+2Δ this entity knows that whatever state information a second entity knew by time t+Δ, is consistent with information s, whether it be the information s or something else all together.

The basic and enhanced state-dissemination arrangements described above, including all the variants mentioned, are well suited for use in disseminating resource allocation information between entities of a system including resources, resource users and one or more resource managers.

FIG. 7 illustrates the use of the FIG. 2 system in disseminating resource allocation information. In particular, the entities 24, 26 and 29 are resources, the entities 25 and 27 are resource users, and the entity 28 is a resource manager. It should, however, be noted that the set of state providers 40 and listeners 41 registered by these entities in the FIG. 7 system is different to that of FIG. 2.

The resource manager 28 is made aware in any suitable manner of the resource needs of the current resource users 25, 27 in the system (for example, the resource users can be arranged to send resource requests to the resource manager directly). The resource manager 28 includes a resource controller 100 that decides which resources are to be allocated to which resource users and then notifies each resource of its allocation (for example, by means of a message sent point-to-point over the network 23). In the present case, the resource manager makes the following allocations:

-   -   resource 24 is allocated to resource user 25     -   resource 26 is allocated to resource user 27     -   resource 29 is allocated to resource user 27

Each resource 24, 26 and 29 always has registered a respective provider 40D, 40F, 40H in respect of the same state-information identifier “SG”, each resource being arranged to provide under this identifier, allocation state information including the identity of the resource and its current allocation. In addition, each resource 24, 26 and 29 is arranged to register a further respective provider 40E, 40G, 401 upon the resource being allocated to a resource user, this further provider being registered in respect of a state-information identifier associated with the resource user to which the resource has been allocated. Thus, the provider 40E is registered in respect of an identifier S25 associated with the resource user 25, and the providers 40G and 401 are registered in respect of an identifier S27 associated with the resource user 27. The providers 40E, 40G and 401 are respectively arranged to provide the aforesaid allocation state information of the resource of which they form a part.

Each resource user 25 and 27 always has registered a respective state listener 41H, 41I in respect of a state-information indicator corresponding to the state-information identifier associated with the resource user. Thus the listener 41H is registered in respect of indicator S25, and listener is registered in respect of indicator S27.

The resource manager always has registered a state listener 41J in respect of a state-information indicator corresponding to the state-information identifier SG.

As a result of this configuration of providers and listeners, any change in the allocation of a resource will result in the allocation state information of that resource being sent from the resource's provider associated with the identifier SG to the corresponding listener of the state manager; the latter is therefore always kept aware of the current allocation of the resources even if it was not responsible for that allocation. Furthermore, each resource user will be passed any allocation information concerning a change of allocation in a resource allocated to it (thus, upon allocation of a resource to a resource user, the resource user is notified of this as soon as the resource has registered the appropriate provider; conversely, when a resource is removed from a resource user, the resource user is first notified before the involved provider of the resource concerned is de-registered by the latter).

In the foregoing example the state manager was arranged to receive allocation state information by registering a single listener in respect of a generic indicator corresponding to a generic identifier SG used by all resources. As already mentioned the correspondence between indicator and identifier can be based on matching portions only of each rather than requiring a full match. Furthermore, generic identifiers and corresponding indicators can be used in respect of sub-groups of resources such as resources of a particular type; in this manner one resource manager can be made responsible for resources of one type and another resource manager responsible for resources of a different type, each resource type being identified by a respective type-generic state-information identifier for which the corresponding manager registers a corresponding indicator.

The resource users and managers observe the following consistency properties depending the form of the state-dissemination arrangement used:

-   -   If the basic state-dissemination arrangement is used, a resource         user or manager can only assume that it can discover resources         and resource allocations that have existed. If a resource         manager changes a resource allocation and it observes the         change, it cannot assume that any other resource manager or         resource user will ever observe the change.     -   If the T SD arrangement is used, a resource user or manager can         assume that any resources and resource allocations it observes         are correct to within the aforesaid predetermined time limit. A         resource manager can assume that any resource allocation it         observes is either observed, or its absence is observed, by all         other resource managers and users within the predetermined time         limit. This level of consistency allows a resource manager to         know allocations do not conflict. For example, a resource         manager can withdraw a resource from one resource user and then         after a delay equal to the predetermined time limit allocate it         to another resource user, knowing that the resource users will         not believe they both own the resource at the same time.     -   If a TPSD arrangement is used, a resource user or manager can         assume that: any resource and resource allocation it observes is         also observed by all other interested resource users and         managers in its partition within the predetermined time limit         (though the resource user or manager can only rely on this after         twice the time limit); and is not observed by any resource user         or manager outside its partition. This level of consistency         allows resource managers to take coordinated actions without         requiring additional direct communication. For example, assume         that there is a resource requirement for a database server that         must be in the same partition as a web server. If a network         failure results in the database and web server resources being         in different partitions, then a resource manager local to the         database server will observe the change and withdraw the         database server resource (possibly terminating the database) and         a resource manager local to the web server can calculate a time         by which it can safely allocate a resource for a new database.         The result is that two resource managers that cannot         communication can guarantee that no two resources are allocated         for the database server role at the same time.

As will be apparent from the state-dissemination arrangements described above, the allocation status of a resource is maintained by the resource itself rather than in some external entity such as the state-dissemination arrangement. A small efficiency improvement may, however, be obtained if an SD server caches the last state information it receives from each local provider 40 as this enables it to respond, without consulting the provider concerned, to a request for the state information for provision to a newly registered listener interested in that information.

It will be appreciated that many variants are possible to the above described embodiments of the invention. For example, the implementations of the state-dissemination arrangement described with reference to FIGS. 2 to 7 are by way of example and other implementations are possible, particularly with respect to how the interest of an entity in particular state information is associated with the source(s) of such information.

In certain cases, a resource manager is not required. For example, each resource in a system can be pre-allocated to a specific resource user, each resource storing this allocation so that as soon as the resource becomes available to the system, it can register a suitable state provider with the state-dissemination service to make its allocation to a particular resource user known to the system without having to wait to be allocated to a resource by a resource manager. It is also possible to arrange for a resource to be its own manager allocating itself to resource users as it sees fit (which can include an initial allocation to a predetermined user).

It may also be noted that even where a resource is being managed by a separate resource manager, it does not necessarily always have to accept the allocation instructions received from the manager. Advantageously, the resource can apply a predetermined set of rules to filter the allocation instructions it receives. For example, the resource may refuse to accept an allocation instruction because it has an overriding rule never to accept allocation to the specified resource user; or because it has simultaneously received a conflicting allocation instruction from another resource manager which, according to another rule, has higher priority; or because it is already at the limit of the number of resources to which it can be simultaneously allocated according to a further rule.

Whilst resources are preferably arranged to provide their allocation state information to the state-dissemination service whenever this state information changes, the allocation state information can additionally or alternatively be provided to the state-dissemination service in other circumstances, such as at regular time intervals.

Resource users can also be arranged to take allocation decisions—for example, resource users can be given authority to transfer resources allocated to them to other resource users. This is akin to the resource user that is allocated a resource effectively having an ownership right in the resource including the right of disposition. Such a right of disposition can be exercised directly by the resource user or through an existing resource manager.

Resource users can also be arranged to carry out role allocation between the resources allocated to them. Thus, rather than a resource user asking a resource manager for one database server and four application servers, it can simply ask for five generic servers and then subsequently allocate the roles of database server and application server between the generic servers it is allocated by the resource manager.

It may also be noted that where resource managers learn of the needs of the resource users by resource requests sent by the users to the managers, a resource user is preferably arranged to repeat a resource request periodically until it observes, via the state-dissemination service, that it has been allocated the requested resource. This builds resilience into the system and enables resource managers to drop resource requests if necessary.

Each resource user can be provided with a respective associated resource manager (indeed, a resource user and a manger can be combined in a single entity). In this case, the resource user and associated manager effectively form a combination equivalent to the resource user acting as its own resource manager.

It will be appreciated that the SD servers, the resources, resource users and resource managers described above will typically be implemented using appropriately programmed general purpose program-controlled processors and related hardware devices (such as storage devices and communication devices). However, other implementations are possible. 

1. A system comprising: a plurality of resources each arranged to maintain and provide state information about its allocation to one or more resource users and its identity; a state-dissemination arrangement for disseminating the state information provided by the resources; and at least one state-information observer arranged to receive state information from the state-dissemination arrangement, said at least one state-information observer comprising at least one resource user arranged to use the state information it receives to ascertain which of the resources, if any, have been allocated to it
 2. A system according to claim 1, wherein the said at least one state-information observer further comprises at least one resource manager arranged to receive state information from the state-dissemination arrangement whereby to ascertain the allocation of the resources of interest to the manager.
 3. A system according to claim 1, wherein the state-dissemination arrangement is arranged to deliver the state information provided by all the resources to the or each state-information observer.
 4. A system according to claim 1, wherein the or each state-information observer with an interest in respective particular state information is arranged to register with the state-dissemination arrangement to indicate its interest in that information; the state-dissemination arrangement being arranged to use these registered interests to manage the dissemination of state information.
 5. A system according to claim 4, wherein the or each state-information observer is arranged to register its interest in particular state information by registering a state-information indicator indicative of the state information it wishes to receive, and each resource is arranged to provide state information identified by a state-information identifier; the state-dissemination arrangement including association means for matching state-information identifiers with registered state-information indicators whereby to enable the dissemination of state information to be managed by the state-dissemination arrangement according to the registered interest of the or each state-information observer.
 6. A system according to claim 5, wherein each resource is arranged to register with the state-dissemination arrangement a state-information identifier for state information to be provided by the resource, the association means of the state dissemination arrangement being arranged to match registered state-information identifiers with registered state-information indicators and to store association data associating each registered state-information identifier with data for managing the dissemination of the state information identified by that identifier, the state dissemination arrangement being arranged to use the association data in disseminating said state information.
 7. A system according to claim 6, wherein the state-dissemination arrangement comprises multiple state-dissemination servers arranged to communicate with each other; each resource and the or each state-information observer being operatively associated with a respective said state-dissemination server, and said association data serving to associate each registered state-information identifier with the or each state-dissemination server operatively associated with any said state-information observer that has registered a state-information indicator matching the identifier; each state-dissemination server being arranged to use the association data to disseminate the state information it receives from a resource, to the or each server indicated by the association data as being associated with the identifier with the latter server being arranged to pass on the state information to the or each state-information observer operatively associated with it that has registered a state-information indicator matching the identifier of the state information concerned.
 8. A system according to claim 7, comprising processing nodes interconnected by a communications network; the resources and the said at least one state-information observer being distributed between the processing nodes as part of the latter, and each processing node being provided with a respective said state-dissemination server with the servers being arranged to communicate with each other over the communications network.
 9. A system according to claim 7, wherein each state-dissemination server is arranged to tore registration data indicative, for the or each resource and state-information observer with which it is operatively associated, of any state-information identifiers and indicators registered by the resource or state-information observer concerned.
 10. A system according to claim 7, wherein the association means comprises a respective component forming part of each said state-dissemination server, each such component being arranged to store said association data in respect of the state-information identifiers registered by any resources operatively associated with the server of which the component forms a part.
 11. A system according to claim 7, wherein each said state-dissemination server is arranged to store source data indicative, for the state-information indicators registered by any state-information observer operatively associated with the server, of the or each state-dissemination server that is operatively associated with a said resource registered to provide state information corresponding to said indicators.
 12. A system according to claim 7, wherein the association means comprises a respective state manager forming part of each said state-dissemination server, the state manager of each state-dissemination server comprising a local registry which in turn comprises: a local register arranged to store the association data related to the server, and registration data indicative, for the or each resource and state-information observer operatively associated with the server, of any state-information identifiers and indicators registered by the resource or state-information observer concerned; first update means arranged to register a state-information identifier for a said resource operatively associated with the server, by updating the registration data in the local register accordingly; and second update means arranged to register a state-information indicator for a said state-information observer operatively associated with the server, by updating the registration data accordingly.
 13. A system according to claim 12, wherein the association means further comprises a global registry; the first update means of each local registry being arranged when registering a new state-information identifier to send a first message including that identifier to the global registry, and the second update means of each local registry being arranged when registering a new state-information indicator to send a second message including that indicator to the global registry; the global registry comprising: a global register arranged to store, for each said state-information identifier and indicator, data indicative of the or each state-dissemination server where that identifier or indicator has been registered; first update means responsive to receipt of a said first message to update the global register regarding the identifier in the message, find any matching state-information indicator in the global register, and return to the server that sent the message association-update data indicative of the or each state-dissemination server where a matching indicator was registered; and second update means responsive to receipt of a said second message to update the global register regarding the indicator in the message, find any matching state-information identifier in the global register, and send to the or each state-dissemination server where that identifier was registered, association-update data indicative of the state-dissemination server that sent the second message; each local registry further comprising third update means for updating the said association data held by the local register in response to receipt of said association-update data from the global registry.
 14. A system according to claim 5, wherein each said resource is arranged to register a state-information identifier comprising at least a generic portion the same for all resources, the association means being such as to detect a match between all these identifiers and a state-information indicator comprising this generic portion whereby a state-information observer can register a single state-information indicator to receive state information from all resources.
 15. A system according to claim 5, wherein each said resource of a given type is arranged to register a state-information identifier comprising at least a type-generic portion the same for all resources of said type, the association means being such as to detect a match between all these identifiers and a state-information indicator comprising this type-generic portion whereby a state-information observer can register a single state-information indicator to receive state information from all resources of said type.
 16. A system according to claim 5, wherein the or each resource allocated to a particular said resource user is arranged to register a state-information identifier comprising at least a user-specific portion the same for all these resources, the association means being such as to detect a match between all these identifiers and a state-information indicator comprising this user-specific portion whereby the resource user concerned can register a single state-information indicator to receive state information from all resources allocated to it.
 17. A system according to claim 5, wherein the association means is arranged to detect a match between a said state-information identifier and a state-information indicator only in the event of a complete match over the full extent of both.
 18. A system according to claim 5, wherein the association means is arranged to detect a match between a said state-information identifier and a state-information indicator upon at least a part of the identifier matching at least a part of the identifier
 19. A system according to claim 5, wherein a said at least one state-information observer is arranged to register multiple different state-information indicators.
 20. A system according to claim 5, wherein at least one resource is arranged to register multiple state-information identifiers identifying the same state information.
 21. A system according to claim 5, wherein at least one resource is arranged to register a further state-information identifier identifying additional state information not including the allocation of the resource, and a said at least one state-information observer is arranged to register a further state-information indicator matching said further state-information identifier whereby to receive said additional state information.
 22. A system according to claim 2, wherein the or each resource manager is arranged both to use the state information it receives to ascertain the current allocation of resources of interest to it, and to control the allocation of those resources by sending control messages to the resources.
 23. A system according to claim 22, wherein there are multiple resource managers, these resource managers being arranged to communicate with each other to coordinate the allocation of resources.
 24. A system according to claim 1, wherein each resource is arranged to provide its state information to the state-dissemination arrangement upon a change in the allocation of the resource.
 25. A system according to claim 1, wherein at least one said resource is arranged to store a predetermined initial allocation whereby upon that resource becoming available to the system, it can provide state information including said initial allocation without having to wait to be allocated to a resource user by the system.
 26. A system according to claim 1, wherein at least one said resource is arranged to act as its own manager to dynamically change its allocation to resource users.
 27. A system according to claim 1, wherein at least one said resource is arranged to use a predetermined rule set to filter allocation instructions it receives from other entities of the system.
 28. A system according to claim 1, wherein there are multiple resource users, at least one said resource user being arranged to transfer a resource it no longer requires to another of the resource users by instructing the resource, regarding the change in its allocation, either directly or through a resource manager of the system.
 29. A system according to claim 1, wherein at least one said resource user is arranged to allocate roles to generic resources allocated to it.
 30. A system according to claim 1, wherein at least one said resource user is arranged to act as its own resource manager to dynamically allocate resources to itself.
 31. A system according to claim 1, wherein the state-dissemination arrangement includes communication timing means for monitoring the communication time taken to disseminate information from a resource to the or each state-information observer that wishes to receive state information from it, the communication timing means being arranged to cause the or each such state-information observer to be informed, upon the monitored communication time for disseminating information to them from the resource concerned exceeding a predetermined time value, that state information for the resource is no longer available.
 32. A system according to claim 31, wherein the state-dissemination arrangement further includes partition means for identifying non-overlapping collections where each collection comprises at least one resource and at least one state-information observer between all of which state information can be disseminated within said predetermined time limit as monitored by the communication timing means; the said at least one state-information observer of a said collection being arranged to take account of state information only from resources within the same collection; and the state-dissemination arrangement being further arranged to inform the state-information observers of a collection of any disruption to collection membership whereby each such state-information observer knows that it cannot rely upon the receipt by interested state-information observers of the collection, of any item of state information which the state-information observer itself has received within an immediately preceding time period of duration corresponding to twice said predetermined time limit.
 33. A system according to claim 11, wherein each state-dissemination server includes communication timing means arranged to monitor whether the server can still communicate within a communication time limit with other state-dissemination servers and, upon this limit not being met in respect of any such other server, to use the source data of the server of which it forms a part to inform any operatively-associated state-information observer that had registered to receive state information coming from a said resource associated with the server with which communication is out of time, that this state information is no longer available.
 34. A system according to claim 33, wherein each state-dissemination server further includes partition means for identifying, in cooperation with the partition means of other servers, a collection of servers, including itself, between all of which state information can be disseminated within said predetermined time limit as monitored by the communication timing means of the servers, the partition means being such that each state-dissemination server only belongs to one collection; the state-dissemination arrangement being arranged to provide the state-information observers of a said collection with state information only from resources within the same collection; and each state-dissemination server of a collection being further arranged to inform its operatively associated state-information observers of any disruption to collection membership whereby each such state-information observer knows that it cannot rely upon the receipt by interested state-information observers of the collection, of any item of state information which the state-information observer itself has received within an immediately preceding time period of duration corresponding to twice said predetermined time limit.
 35. A system according to claim 1, wherein the state-dissemination arrangement is so arranged that all components receiving a particular item of state information from it can, within a defined time, rely on all interested components having received the information. 